Special Relativity

In a Galilean transform, only the time basis vector is shifted. In a Lorentz transformation, both time and space basis vectors are shifted. This is a direct consequence of the observation that the concept of "simultaneous events" depends on the reference frame. Furthermore, the lengths of the time and space basis vectors are also changed (relative to the first frame), reflecting the observations of time dialation and Lorentz contraction. The exact details behind these transformations will fall straight out of the geometric algebra for relativity.

Algebraic Framework

The fundamental thing we need for starting out is the Minkowski metric (we follow the mostly-minus convention because its cooler or something), which is given (for 3+1 spacetime) by:

This could easily be extended to any number of (spatial) dimensions but… that's a bit scary. To package this into an algebraic framework, we can work in the geometric algebra (or for general spatial dimensions, the algebra is still the same) called the Spacetime Algebra (STA).

Tensor index notation has a similar convention to what you would find in the wild: greek indices represent all possible values (0, 1, 2, 3) but latin indices represent only nonzero values (1, 2, 3).

The basis blades we have here are:

With the tour of the basis blades out of the way, we can now move on to doing actual things with these now. We will first visit the properties of the algebra before turning to relativity stuff.

Timelike vs Spacelike

There is an important distinction to be made regarding the squares of certain basis blades: Basis blades squaring to are said to be spacelike, and ones squaring to are said to be timelike. So revisiting our earlier tour of the basis blades:

As the name suggests, timelike blades have something to do with time and spacelike blades have something to do with space. For general multivectors, timelike refers to having positive square, spacelike refers to having negative square, and lightlike or null refers to having 0 square.

Reciprocal Basis

The reciprocal basis pops up whenever you mix Geometric Algebra with calculus because… people like putting superscripts wherever possible I guess.

The reciprocal basis for STA is defined by the equation

In terms of down-to-earth tangible computable things, we have: and , ie.

Spacetime Splits

A nice property of STA is that its even subalgebra is isomorphic to the Algebra of Physical Space (APS) . The spacetime split coupled with this isomorphism allows us to separate objects in STA into their temporal part as a scalar and their spatial part as a vector in APS.

The details of the isomorphism are:

The spacetime split itself is simply just right-multiplication by ¹ ; to split a spacetime vector , you simply compute . Abbreviating the STA bivectors as , we have for a general spacetime vector ,

And noting that these correspond exactly to the basis vectors of APS under the aforementioned isomorphism, this cleanly shows a vector being split into a scalar temporal part and a vector spatial part.

But these components are not absolute (there is no absolute time or space). Instead, this splits the vectors in the reference frame of an observer whose temporal basis vector is . If another observer has temporal basis vector , then performs a spacetime split in their reference frame.

Spacetime Events and Spacetime Interval

Events in spacetime are represented by vectors in STA. One can specify a temporal part (scalar), and a spatial part (vector), and by inverting spacetime splits, it can be seen that this data maps to a vector in STA.

It also makes sense to ask what is the Lorentz-invariant notion of distance between events. This is what the spacetime interval is, and is very straightforwardly given to us from the algebra. It is simply the squared length as computed from the inner product or squaring the vector, so for 2 events , the spacetime interval between them is . For a single event , this can be used to calculate its Lorentz-invariant squared length as . So the spacetime interval is just the squared length of the event that represents the difference of 2 events.

Note that although the quantity is being squared, the value itself can be positive, zero, or negative. What this means for the length in each case is:

Lorentz Transformations

I will start out with a bit of a spoiler: all Lorentz transformations are just rotations (in STA). This is why I hinted earlier that the bivectors help generate all Lorentz transformations, the same formula for rotations in Geometric Algebra help produce the formulas for Lorentz transformations.

Lorentz transformations encompass 2 types of transformations: Lorentz boosts and normal euclidean rotations. Lorentz boosts are generated from timelike bivectors, and normal euclidean rotations are generated from spacelike bivectors. Everyone knows how to rotate in euclidean space, so I focus on Lorentz boosts here.

For simplicity, assume we want to boost in the direction of . The the rotor for boosting in this direction is simply given by rotation by units in the plane:

Since we know , a slight twist on the Euler identity gives us an equivalent expression in terms of hyperbolic trigonometric functions

We can now investigate the action of Lorentz boosts on events. By linearity, we can just focus on its action on the basis vectors, which is split up into 3 cases.

To have tangible results for a Lorentz boost of velocity , we set where . With by simple properties of hyperbolic trigonometric function [citation needed], it is easy to see that:

Here is called the Lorentz factor.

These completely define the action of a Lorentz boost in direction on any spacetime event. By extension, you can figure out the action of a Lorentz boost in any direction on spacetime events (by simply composing with a euclidean rotation).

Squared length is Lorentz-invariant

A short proof that squared length is actually Lorentz-invariant for the doubters out there.

Let be any rotor (can include euclidean rotations along with Lorentz boosts), and a spacetime event. We have to prove that :

Speed of light is constant

Using this, we can also demonstrate that the speed of light is constant in all (inertial) reference frames. WLOG we can, as before, take the rotor to be . The worldline of light contains all vectors s.t. .

It then becomes a trivial corollary of the above result that , so this is also on the worldline of light.

So we proved the main postulate of Special Relativity using… Special Relativity? Yeah I lied, and this is where we discuss about the Minkowski metric and Spacetime Algebra. Spacetime Algebra is straightforwardly derived from the Minkowski metric. But the Minkowski metric itself is not given from thin air, it actually comes naturally from the postulate "the speed of light is constant in all inertial reference frames." So we just used the postulate to derive the postulate (it's a good sanity check though).

Time Dilation & Lorentz Contraction

Time Dialation

Time dialation is the result of measuring the temporal component of a vector with different bases from different reference frames. To illustrate this with an example, start with a purely temporal event and the rotor for our boost being . Labelling the new basis with , we can derive the following relation:

Then our purely temporal event is now measured to have components

(Note: this can also be derived from the action of on ).

Hence we see a different component in the new basis: , the effect of time dialation.

Lorentz Contraction

Lorentz contraction is a bit different. Not only are you measuring with a different basis, but you are also measuring a slightly different vector.

To see why, we start with the simple example of a plank with endpoints at time being and (so that the plank has length ). This plank's endpoints' worldlines trace out 2 parallel lines in its rest frame.

Now let's perform a Lorentz boost in the direction. Our new coordinate system has bases , and the equations to convert into these is:

And hence our very same plank's endpoints now have components

To see the problem, let's focus on the situation at the first instant, ie. put . We note the components of each endpoint in both the original and boosted coordinate frames:

The problem is that the change in frame of reference has also caused a change in the (hyper)plane of simultaneity. Since distances have to be measured in a (hyper)plane of simultaneity, we could use and as reference points for measuring the length of the plank in the original frame of reference. But in our new frame of reference, these 2 events are not simultaneous anymore, so our reference points need to be changed.

Say we take the reference points to be and where so that . It is easy to see that , so we have the equation

By just equating the component that matters, we see that

Which now properly demonstrates Lorentz contraction.

4-Vectors

Proper Time

It's about time we give time a proper treatment. Proper time was defined before but now we explore what it actually means.

Consider some observer in uniform motion (relative to us) starting at the origin and stopping at some point . In our reference frame, the time of this journey can be extracted from the component of the event . More precisely, where is the time taken in the journey relative to us.

But ofcouse, the observer disagrees. In their reference frame the journey took time (specifically, where is the relative velocity between us and the observer). Everyone can disagree on how much time this journey took in their own reference frame (and still be equally right!), but no one can disagree on how much time the journey takes in the reference frame of the observer taking part in the journey itself. That is, the time taken in the journey from the reference frame of the journey is Lorentz-invariant, it does not depend on which reference frame you try to ask the question from.

This is proper time, the time taken in a journey from the reference frame of the journey. Since it is Lorentz-invariant, we have a straightforward Lorentz-invariant formula to calculate it too: .

Using this, we can now give a defintion to an inertial reference frame. An inertial reference frame is one such that the derivatives of its basis vectors wrt. proper time is 0:

Table of 4-Vectors

Euclidean Vector (3-Vectors) Minkowski Vectors (4-Vectors)
Position
Velocity
Momentum
Acceleration
Force

Specific 4-vectors have an easy relation with the corresponding 3-vectors.

Non-Inertial Frames

Special Relativity can infact handle non-inertial reference frames. The downside is weird things occur for non-inertial observers. Notably:

To get a feel for how weird it is, we do a "short" analysis on one particular case of a non-inertial reference frame.

Rindler Coordinates

Setup

The observer we will be observing has constant proper acceleration. This takes a bit of explaining…

Consider the state of the observer at some frozen instant of time. In their reference frame with coordinate bases , their 4-velocity at this instant is

so it is purely temporal. By the earlier result that , we conclude that their 4-acceleration in this coordinate system is purely spatial

It then follows that the squared length of their 4-acceleration is . This is their (negative of squared) proper acceleration. The condition of constant proper acceleration requires that this value stays constant at all times (it is a Lorentz-invariant scalar so it doesn't matter who measures it to be constant).

Analysis

For the sake of simplicity, we assume the spatial acceleration in the frame of the observer is purely in the direction. Hence in our frame, will have nonzero components in only the basis. Applying the law that , we get that

Where in the last step, the positive branch is enforced to make acceleration in the direction. The other conversions were done using the componentwise expansion of and . A similar lengthy calculation shows that . Since we are an inertial reference frame, this leads us to a set of coupled differential equations

The general solution (for just for now) is

To fix some values of , we need to pick an initial condition and then apply a law we've derived before. The initial condition will be to enforce which gives us the restriction , and our equation reduces to

From this we derive that

Now we apply at to obtain . This simplifies both our equations to

At this point, the rest is free. Just perform simple integration to get the components of the 4-position of the observer.

The constants of integration can be arbitrary in this case, as they represent the starting point of the journey. To make things simple, we set , which amounts to .

The Rindler Horizon

We have proven an interesting fact about constant accelerated motion: that its worldline in spacetime is a hyperbola. Hence this motion is asymptotic to some straight worldline. It just so happens that this motion is asymptotic to the worldline of a light beam travelling in the direction passing through our origin:

This gives rise to an interesting phenomenon: Any information from us transmitted to this observer after will never reach the observer (unless they stop accelerating).

This line is an artifical "event horizon" called the Rindler Horizon. The observer cannot see anything cross this horizon from their perspective. In the perspective of the observer, the Rindler Horizon is always at a constant proper distance of behind them, called the Rindler Distance.

Rindler Coordinate System

Our next task would be to figure out the coordinate system this observer would be using. As noted earlier, it will be a curvilinear coordinate system:

This gives us the formulas to convert between components of a 4-position expressed in our basis and in the Rindler coordinate basis:

so that for a given 4-position , we have

(here the index is fixed, no Einstein summation convention applied).

Rindler Metric

(Heads Up: This subsection is in tensor algebra instead because I have no idea how to do this in Geometric Algebra.)

In our own coordinates, the line element is given by the Minkowski metric:

Our job is to convert this line element into Rindler coordinates.

  1. Firstly we calculate where the acceleration is in the direction as always in this section. By the equation , we obtain

  2. Then we substitute this into the line element. A bit of algebra and stuff cancelling out gives us

This completes the derivation of the Rindler metric. Idk what else to say about it; you can calculate distances and angles in the reference frame of the accelerating observer now I guess.

Speed of a Light Beam

The speed of a light beam as calculated relative to some non-inertial reference frame need not always equal . This also has the physical implication that the accelerating observer (if they have a sharp enough eye) would be able to discern some light beams going faster or slower thant .

In the case of constant proper acceleration, the metric calculated above plops out the exact formula we need for the speed of a light beam in the direction:

The second form tells us a nice relation between the Rindler Distance, the coordinate at which speed of light beam was measured, and the measured speed. The speed of a light beam is simple the fraction of the Rindler Distance from the Rindler Horizon scaled by . This means the observer at their own location sees light travel at its usual speed.

It is important to note that these measurements are only for the instant . The actual light beam over time as seen by the accelerating observer would exponentially increase in speed over time due to getting farther and farther away (or exponentially decrease if its going towards the Rindler Horizon).


¹ In most other sources, the spacetime split is usually done by left-multiplication by . That is because they use the weird convention of ordering the timeline bivectors as instead of . No idea why.

² Full credits to @spookyyomo and @theslackline on discord for helping me realize this.